Hybrid PSO and GA Models for Document Clustering

نویسنده

  • K. Premalatha
چکیده

This paper presents Hybrid Particle Swarm Optimization (PSO) Genetic Algorithm (GA) approaches for the document clustering problem. To obtain an optimal solution using Genetic Algorithm, operation such as selection, reproduction, and mutation procedures are used to generate for the next generations. In this case, it is possible to obtain local solution because chromosomes or individuals which have only a close similarity can converge. In standard PSO the non-oscillatory route can quickly cause a particle to stagnate and also it may prematurely converge on suboptimal solutions that are not even guaranteed to local optimal solution. This work proposes hybrid models that enhance the search process by applying GA operations on stagnated particles and chromosomes. GA will be combined with PSO for improving the diversity, and the convergence toward the preferred solution for the document clustering problem. The approach efficiency is verified and tested using a set of document corpus. Our results indicate that the approaches are feasible alternative to solve document clustering problems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

FEASIBILITY OF PSO-ANFIS-PSO AND GA-ANFIS-GA MODELS IN PREDICTION OF PEAK GROUND ACCELERATION

In the present study, two new hybrid approaches are proposed for predicting peak ground acceleration (PGA) parameter. The proposed approaches are based on the combinations of Adaptive Neuro-Fuzzy System (ANFIS) with Genetic Algorithm (GA), and with Particle Swarm Optimization (PSO). In these approaches, the PSO and GA algorithms are employed to enhance the accuracy of ANFIS model. To develop hy...

متن کامل

Discrete PSO with GA Operators for Document Clustering

The paper presents Discrete PSO algorithm for document clustering problems. This algorithm is hybrid of PSO with GA operators. The proposed system is based on population-based heuristic search technique, which can be used to solve combinatorial optimization problems, modeled on the concepts of cultural and social rules derived from the analysis of the swarm intelligence (PSO) with GA operators ...

متن کامل

Classification of Two Class Motor Imagery Tasks Using Hybrid GA-PSO Based K-Means Clustering

Transferring the brain computer interface (BCI) from laboratory condition to meet the real world application needs BCI to be applied asynchronously without any time constraint. High level of dynamism in the electroencephalogram (EEG) signal reasons us to look toward evolutionary algorithm (EA). Motivated by these two facts, in this work a hybrid GA-PSO based K-means clustering technique has bee...

متن کامل

A Joint Semantic Vector Representation Model for Text Clustering and Classification

Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...

متن کامل

Prediction of Gain in LD-CELP Using Hybrid Genetic/PSO-Neural Models

In this paper, the gain in LD-CELP speech coding algorithm is predicted using three neural models, that are equipped by genetic and particle swarm optimization (PSO) algorithms to optimize the structure and parameters of neural networks. Elman, multi-layer perceptron (MLP) and fuzzy ARTMAP are the candidate neural models. The optimized number of nodes in the first and second hidden layers of El...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010